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1 ATAATGGGAA AGTAO^^G TTATTAAACC CGTAACAACT CCGAGTGTAA 

51 TACAATTAAC GCCTGATGAT AGAGTAACGC CTGATGATAA AGGT GAATAT 

101 CAACCCGTTG AAAAGCAAAT AGCGGGAGAT ATAATACGTG TACTAGAATT 

151 CAAGCAAACA AATGAAAGTC ATACAGGATT GTATGGAATT GCATATCGAG 

TOX F2 ^ 

201 CTAAGAAAGT AATAATAGCA TATGCTTTAG CGGTAAGTGG TATTCATAAT 

251 GTCTCTCAAC TTCCAGAAGA CTATTATAAA AATAAGGATA AC AC AG GT AG 

301 AATTTATCAA GAATACATGT CTAATCTTTT ATCTGCACTA TTGGGTGAGA 

351 ATGGTGATCA AATTTCTAAA GATATGGCAA AT GATTTTAC CCAGAACGAA 

4 01 CTGGAGTTTG GAGGTCAACG TCTTAAAAAT ACCTGGGATA TTCCTGATCT 

451 T G AGAAT AAA CTATTGGAAG ATTATTCAGA T GAAG AT AAA TTATTAGCAC 

TOX Fl 

501 TATATTTCT? TGCTTCACAA GAACTTCCAA TGGAGG CAAA TCAACAATCA 

► * TOX R3 

551 AAT GC AG C AA ATTTTTTTAA AGTAATTGAT TTTTTACTTA TCTTATCTGC 

601 TGTAACATCA CTGGGAAAAA GGATTTTTTC AAAAAATTTT TACAATGGTC 

651 TAGAAACTAA AT CAT TAG AG AATTAT AT T G AGAGAAAAAA ACTTTCTAAA 

TOX F3 > 

701 CCTTTCTTTC GACCACCGCA GAAGTTACCT GATGGCAGAA CAGGCTACTT 

751 GGCCGGTCCA ACAAAAGCGC CTAAATTGCC AACAACG TCT T CT AC AG C AA 

* TOX R4 

801 CAACGTCT AC AGCAGCTTCA TCTAATTGGA GAGTTAGTTT GCAAAAACTT 

851 AGAGATAACC CAT C C AG AAA TACATTTATG AAAATGGATG ATGCTGCAAA 

901 AC GAAAAT AT AGTTCATTTA TAAAAGAGGT ACAAAAGGGT AAT GAT C C AC 

951 GTGCAGCAGC AGCAAGTATT GGTACAAAAA GCGGCAGTAA CTTCGAAAAA 

1001 CTGCAAGGTA GAGATTTATA TAGTATAA GA CTAAGCCAAG AACACAGG GT 

4 A24AC1 
1051 AACATTCTCC ATAAATAATA CTGACCAAAT AATGGAGAT C CAAAGTGTTG 

1101 GAACTCATTA CCAAAATATA [tA^CCTGATT TATAGTAGTG ATAAGACGTA 

1151 AGATAAATAT GGAAGGTTGT AATTCTATTG CACTTCCTCA GAGGTGACCG 

1201 CTCAG 
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1 MVTKPVTTPS VIQLTPDDRV 

51 SHTGLYGIAY RAKKVIIAYA 

101 MSNLLSALLG ENGDQISKDM 

151 EDYSDEDKLL ALYFFASQEL 

201 KRIFSKNFYN GLETKSLENY 

251 APKLPTTSST ATTSTAASSN 

301 FIKEVQKGND PRAAAASIGT 

351 NTDQIMEIQS VGTHYQNI 

FIGURE 2 



TPDDKGEYQP VEKQIAGDII RVLEFKQTNE 

LAVSGIHNVS QLPEDYYKNK DNTGRIYQEY 

ANDFTQNELE FGGQRLKNTW DIPDLENKLL 

PMEANQQSNA ANFFKVTDFL LILSAVTSLG 

IERKKLSKPF FRPPQKLPDG RTGYLAGPTK 

WRVSLQKLRD NPSRNTFMKM DDAAKRKYSS 

KSGSNFEKLQ GRDLYSIRLS QEHRVTFSIN 
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1 AAGCTTGCTA ATAATTCTTG CGTAAGTTAA TTTTACATTG AAAT TAACGC 
Hind III 

51 TTAAAAAGCC AGGGAAAACT CTATATTTAA AGTTGAAATT TATATTAGTA 

101 GCGACAAATT GCGGAGTTTT CTGCCAGAAA TTTCATAGCT CAAATAAACA 

151 TTAACATAAT G G AGAAAT AT J^^GTTATA CAATTAACAC CT GAT GAT AG 

201 AAGT G GAT AT C CACCCGTTG AAAAGCAAAT AGCAGGAGAT ATAGTACGTA 
Eca RV 

251 TACTAAACTT TAAGCAAACA GATGAGGGTC ATACAGCATC ATATGGAATT 

301 GAATATCGAG CTAAGAAAAT AATATTAGCT T ACGCTTTGG CT GT AAGT GG 

* AC4R 

351 TATT CAT AAT GTATCTAAAC TTCCTGATGA CTATTATAAG AATAAAGAGA 
4 01 CTGCTGAGAG AATTTAT CAA GAATAT AT GT CTAATCTTTC ATCTGCACTA 

4 51 TTAGGTGAAA ATGGTGATCA AATTT CTAAA GATATGGCAA AT GGTTTTTA 

AC2F 

501 TAAGAATGAA CTGGATTTTG AAGGTCAATA TCCTCAAAAC ATTTGGAATG 

5 51 TTCCTGAGCT T GAAAAT AAA CCATT GAGTG CTTATTCAGA T G AC GAT .AAA 

4 AC7R 

601 TTATTAGCAC TATATTTTTT CTCTGTACAG GAAATTCCAC TG GAG GAAAA 

651 TCAACAATCA AATGCCGCAA GATTTTTTA^ AT T AATT GAT TTCTTATTTA 

701 CCTTATCTGC TGTAACTTCA CTGGGAAGGA GGATTTTTTC AAAAAACTTT 

751 TACAATGGAT TAGAGGCTAA AT C ATT AG AG AATTATATTG AGAGAAAAAA 

AC6F h 

801 ACT TT CTAAA CCTTTCTTTC GACCACCGCA GAGATT AC CT GAT G G CAGAA 

851 TAGGTTATTT GGCTGGA CCA ACAGAAGCGC CTAAATG GAG AGTGAGTTTT 

4 AC5R 

901 AAAGAACTTA AAAATAACAA AT C TAG G AAT GGATTTTCTA AT AT G GAAGG 

951 GGCTGCAAAA CAAAAGTATA GTTCATTTAT A^AAG AG GT A CA*AAGGGTA 

1001 ACGCTCCACA GACAGCAGCG AAAAGT ATT G GTACAGCCAG TGGCAGTAAC 

1051 CTGGAAAAAT TGCCGAATAA TTTATATAGT GTGAGGCTAA GCCAAAAAGA 

AC3F 



1101 CAGGGTAACC TTTACTCAAA AT GAT ACT GA C AAT AC AAT G ACGGTTCATA 

« 



> ^ AC8R 

1151 GTGTTGGAAC T CATTAT AAA AATATOT&AT GAGTAATCTC TGACTTCGAT 



1201 TGACAGAGCA TTTTTAAGCT CTCATTTTCT CAACGGGAGT CTCATAAGGC 

1251 GTTTTACTTT TCAAGCCACT ATGTGGTCTG TGATAATTGT AAAACGCCTT 

1301 * CTTTT AGC C A ATACACTTTA CTACC AAGAA AATAT AT AC C CTAT GGATTT 

* V16AC1 
1351 CAAGATGGAT CGCGGCGGCA AGGGAGCGAA TCCCCGGG 
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1 MVTQLTPDDR SGYPPVEKQI AGDIVRILNF KQTDEGHTAS YGIEYRAKKI 

51 ILAYALAVSG IHNVSKLPDD YYKNKETAER IYQEYMSNLS SALLGENGDQ 

101 ISKDMANGFY KNELDFEGQY PQNIWNVPEL ENKPLSAYSD DDKLLALYFF 

151 SVQEIPLEEN QQSNAARFFK LIDFLFTLSA VTSLGRRIFS KNFYNGLEAK 

201 SLENYIERKK LSKPFFRPPQ RLPDGRIGYL AGPTEAPKWR VSFKELKNNK 

251 SRNGFSNMEG AAKQKYSSFI KEVQKGNAPQ TAAKSIGTAS GSNLEKLPNN 

301 LYSVRLSQKD RVTFTQNDTD NTMTVHSVGT HYKNI 



FIGURE 5 
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17 ATGGTTATTAAACCCGTAACAACTCCGAGTGTAATACAATTAACGCCTGA 66 

I 

172 ATGGT 176 

• * • • • 

67 TGATAGAGTAACGCCTGATGATAAAGGT GAATAT CAAC CCGTTGAAAAGC 116 
I I III! II I I I I I I I I I Ml I I I I I I II I II I II II I I I 

177 TATACAATTAACACCTGATGATAGAAGTGGATATCCACCCGTTGAAAAGC 226 

• • • • ■ 

117 AAATAGCGGGAGATATAATACGTGTACTAGAATTCAAGCAAACAAATGAA 166 

1 11 I 1 1 I 1111111(1 It!!! Mill ! II I I I I t I I I I I I I I 
227 AAATAGCAGGAGATATAGTACGTATACTAAACTTTAAGCAAACAGATGAG 276 

167 AGT CATACAGGATTGTATGGAATT GCATAT CGAGCTAAGAAAGTAATAAT 216 

MINIMI I I I I M II I M I II I I II II II II I I I I I I II I I 
277 GGT C ATACAGCAT CAT AT GGAATTGAATATC GAGCTAAGAAAATAATATT 326 

217 AGCATAT GCTTTAG CGGTAAGT GGT ATTCATAAT GTCTCT CAACTT CCAG 266 

111 II Mill M II I I I I I I I I M II I I II II III I If M M I 
327 AGCTTACGCTTTGGCTGTAAGTGGTATTCATAATGTATCTAAACTTCCTG 376 

267 AAGACTATTATAAAAATAAGGATAACACAGGTAGAATTTATCAAGAATAC 316 

1 M I II II II M I 11 11 I 1 1 I 1 I II II II M M I II It I 
377 ATGACTATTATAAGAATAAAGAGACTGCTGAGAGAATTTATCAAGAATAT 426 

317 AT G T CT A*VT CTTTT AT CT GCAC TATT G GGT GAGAAT GGT.GAT CAAATT T C 3 66 

I II II II I I M II II II M I M I I I I I II I II II II I I I II II II II 
427 AT GT C T AAT C 7 T T CAT C T G C AC TATT AG GT GAAAAT GGT GAT CAAAT T T C 476 



367 TAAAGAT AT G G CAAAT GATTTTAC C C AGAAC GAACT GGAGTTT GGAG GT C 416 

i i M M i M M I I I M I MM MM M M M M MM M I M 
477 T^AAG^TATGGCAAATGGTTTTTATAAGAATGAACTGGATTTTGAAGGTC 526 

417 A^C GT CT TAAAGAT AC CTGGGAT ATT C CT GAT CTT GAGAAT AAACT ATT G 4 66 

I S li I MM i II I II II It II I Mill I ! i M I I MM 
527 AATATCC7 CAA^ACATTTGGAATGTTC CTGAGCTTGAAAATAAAC CATTG 576 

4 67 GAAGATT ATT C AGAT GAAGAT AAATT ATT AGCACT AT AT TT CTTTGCTTC 516 

{ II ( I i ! I I I I I I I I I I I I I I I I I I I I I I I I I I i I i M II 
577 AGT G CTTATT CAGAT GACGATAAATT ATT AGCACTATATTTT TT CT CTGT 626 

517 ACAAGAACTT CCAAT GGAGGCAAATCAACAAT CAAAT G C AGCAAATTTTT 566 

M I Ml I II I I II II II II I If II II II II II II I MM MM 
627 ACAGGAAATTCCACTGGAGGAAAATCAACAATCAAATGCCGCAAGATTTT 67 6 

567 TT AAAGT AATT GATTTTTT ACTT AT CTT AT CT GCT GTAACAT C ACT GGGA 616 

MM! 11 II M M 11 II I Ml I M I II II I I M II I II I I M M I 
677 TTAAATTAATTGATTTCTTATTTACCTTATCTGCTGTAACTTCACTGGGA 726 

617 AAAAGGATTTTTT CAAAAAATTTTT ACAATGGTCTAGAAACTAAAT C ATT 666 

1 II M II M I M M 1 I II I II M II M II MM M II I M M I 
727 AGGAGGATTTTTT CAAAAAACTTTTACAAT GGATT AGAGGCT AAAT CATT 77 6 

667 AGAGAATTATATTGAGAGAAAAAAACTTTCTAAACCTTTCTTTCGACCAC 716 

M M II II II II II I I I II M I I I II M I M II II II I II II I I II I I I I 
777 A'6AGAATTATATTGAGAGAAAAAAACTTTCTAAACCTTTCTTTCGACCAC 826 

717 C GCAGAAGTTAC CT GATGGCAGAACAGGCTACTT GGC CGGT CCAACAAAA 7 66 

M M M M M M 11 1! M I M I Ml M MM! M 11 M M M 
827 CGCAGAGATTACCTGATGGCAGAATAGGTTATTTGGCTGGACCAACAGAA 876 
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767 GCGCCTAAATTGCCAACAACGTCTTCTACAGCAACAACGTCTACAGCAGC 816 
MINIMI 

877 GCGCCTAAA 885 

817 TTCATCTAATTGGAGAGTTAGTTTGCAAAAACTTAGAGATAACCCATCCA 866 

M i I I I I I Mill II M II II I M I M Mil 
886 TGGAGAGTGAGTTTTAAAGAACTTAAAAATAACAAATCTA 925 

867 GAAATACATTTATGAAAATGGATGATGCTGCAAAACGAAAATATAGTTCA 916 

I I I I MM 11 II I II I M II I II : I I I I M I II I I II II 

92 6 GGAAT GGATTTTCTAATATGGAAGGGGCT G CAAAACAAAAGTATAGTTCA 975 

917 TTTATAAAAGAGGTACAAAAGGGTAATGATCCACGTGCAGCAGCAGCAAG 966 

II II M II II M I II II I I I II M II I Mill M I II I I Mi 

97 6 TTTATAAAAGAGGTACAAAAGGGTAACGCTCCACAGACAGCAGCGAAAAG 1025 

967 TATTGGTACAAAAAGC GGCAGTAACTT CGAAAAACTGCAAGGTAGAGATT 1016 
I II I II I II I II II II M II I I I I II II M I II III 
1026 TATTGGTACAGCCAGTGGCAGTAACCTGGAAAAATTGCCGAATA. . .ATT 1072 

1017 TATATAGTATAAGACTAAGCCAAGAACACAGGGTAACATTCTCCATAAAT 1066 

MINIM I M lllllllll 11 11II1IIMI II ) NN 
107 3 T AT AT AGT GT GAG GCTAAGCCAAAAAGAC AGGGTAAC CTTTACT CAAAAT 1122 

1067 AAT ACT GACC AAATAAT GGAGATCCAAAGT GTTGGAACTCATTACCAAAA 1116 

I M M i 1 I ( I MM i I M M M Ml I M II i I i I! MM 
1123 GAT ACT G AC AATACAAT GACGGTT CAT AGT GTTGG AACT C ATT AT AAA£A 1172 

1117 TATA. . . 1120 
I M I 

1173 TATATGA 1179 
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1 MVIKPVTTPSVIOLTPDDRVTPDDKGEYQPVEKQIAGDIIRVLEFKQTNE 50 

Mil Mill : : I . I I I II M f M : I : M I I I I : I 

1 MVIQLTPDDR S G Y P P VEKQI AGD I VRI LN FKQT D E 35 

51 SHTGLYGIAYRAKKVIIAYALAVSGIHNVSQLPEDYYKNKDNTGRIYQEY 100 

MM I ! I . I I I I I : I : I I I i I I i I I I I i I * I I : I I I I I I : • - I I I I I I 
36 GHTAS YGI EYRAKKI I LAYALAVSGIHNVSKLPDDYYKNKETAERI YQEY 85 

101 MSNLLSALLGENGDQISKDMANDFTQNELEFGGQRLKNTWDIPDLENKLL 150 
MM M M II M M M M I II M . II I : I : 1 I . I M : : I M II I I 
8 6 MSNLSSALLGENGDQISKDMANGFYKNELDFEGQYPQNIVWVPELENKPL 135 

1S1 EDYSDEDKL1ALYFFASQELPMEANQQSNAANFFKVIDFLLILSAVTSLG 200 
. - I M M M II I II I . II : I : I . I II I I M M II M I M : M M II M I 

136 SAYS DDDKLLALYFFSVQEI PLEENQQSNAARFFKLI DFLFTLSAVTSLG 185 
• ♦ • • • 

201 KRI FS KNFYNGLETKS LENYI ERKKLS KP FFRP PQKLPDGRTG YLAGPTK 250 
M M M II I M I I M II I II II I I I M I I M M I I M II I I . I I I II M . 

18 6 RRI FSKNFYNGLEAKS LENYI ERKKLS KP FFRP PQRLPDGRIGYLAGPTE 235 

2 51 APKLPTTSSTATTSTAASSWWRVSLQKLRDNPSRNTFMKMDDA^iKRKYSS 300 

III I I M : • • I : : I M II • I . I : : I I I • I II I 

23 6 APK WRVSFKELKNNKSRNGFSNMEGAAKQKYSS 268 

3 01 FI KEVQKGNPPRAAAA5 1 GTKS GSN FEKLQGRDLYS IRLSQEKRVTFS IN 350 

M I II 1 I M M . M I MM I II M I I I • - : I I I : I I I I I I I I . I 
265 FI?;EVQKGNAPQTAAKSIGTASGSNLEKLPN.NLYSVRLSQKDRVTFTQM 317 

351 NTDQIMEIQSVGTHYQNI 368 

• I I • . I . : : I I I I I h I I 
318 DTDMTMT v^K S VGTHYKNI 335 
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PCR amplification with 
proof-reading polymerase 



Ncol 

digest amplicon with Nco I and Sap I 
and clone into pCYB3 vector 



toxin coding 



transform into E. cofi and induce 
expression of fusion protein 



\ 
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/ 
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intein coding CBD coding 



intein 



Met 



CBD 



lle|Cys 

site for intein self-splicing 



affinity purify fusion protein on chitin cellulose 



induce on-column intein cleavage reaction 
elute free toxin 
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